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Abstract 

As cell-invading molecular machinery, coronavirus spike proteins pose an evolutionary 
conundrum due to their high divergence. In this study, we determined the cryo-EM structure 
of avian infectious bronchitis coronavirus (IBV) spike protein from the y-genus. The trimeric 
IBV spike ectodomain contains three receptor-binding SI heads and a trimeric membrane- 
fusion S2 stalk. While IBV S2 is structurally similar to those from the other genera, IBV SI 
possesses structural features that are unique to different other genera, thereby bridging 
these diverse spikes into an evolutionary spectrum. Specifically, among different genera, 
the two domains of SI, the N-terminal domain (SI -NTD) and C-terminal domain (SI -CTD), 
diverge from simpler tertiary structures and quaternary packing to more complex ones, lead¬ 
ing to different functions of the spikes in receptor usage and membrane fusion. Based on 
the above structural and functional comparisons, we propose that the evolutionary spectrum 
of coronavirus spikes follows the order of a-, 5-, y-, and p-genus. This study has provided 
insight into the evolutionary relationships among coronavirus spikes and deepened our 
understanding of their structural and functional diversity. 


Author summary 

Because of their structural and functional diversity, coronavirus spike proteins represent a 
model system for studying viral evolution. Their evolutionary relationships and history 
pose major puzzles in virology. A critical missing piece in understanding the evolution of 
coronavirus spikes is the unavailability of the tertiary structure of y-coronavirus spikes. 
Here we determined the first cryo-EM structure of avian infectious bronchitis coronavirus 
(IBV) spike from the y-genus. The IBV spike contains structural features that are unique 
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to different other genera; hence it brings the spikes from different coronavirus genera into 
one evolutionary spectrum, with itself sitting in the middle of this spectrum. The order of 
the evolutionary spectrum is a-, S-, y-, and (5-genus. Importantly, through comparisons 
between IBV spike and other coronavirus spikes, this study illustrates how coronavirus 
spikes have achieved structural and functional diversity through evolution to guide viral 
entry into host cells. 


Introduction 

As large enveloped RNA viruses, coronaviruses are capable of adapting to new hosts with rela¬ 
tive ease through mutations and recombinations [1-3]. As a result, coronaviruses infect a wide 
range of mammalian and avian species, and have genetically evolved into four major genera: 
a, p, y, and 8 [4]. Coronaviruses from the four genera all contain envelope-anchored spike pro¬ 
teins that mediate viral entry into host cells [5, 6]. During viral entry, the spikes bind to host 
receptors through their SI subunits and then fuse viral and host membranes through their S2 
subunits. On the one hand, the spikes interact with host receptors and other host factors, 
hence needing to evolve for better adaptation to these host factors [7-10]. On the other hand, 
they are exposed to the host immune system, thereby needing to evolve to evade the host 
immune surveillance [11-14]. Consequently, the spikes are the most divergent among all coro¬ 
navirus proteins [6]. The SI subunits are particularly divergent, with little or low sequence 
similarities across different genera [15]. How coronavirus spikes have evolved to their current 
diverse structures imposes a major evolutionary conundrum. 

Traces of protein evolution can often be found more reliably in their tertiary structures and 
related functions than in their primary structures, because proteins generally need to evolve 
within certain structural and functional constraints [16,17]. To decipher the evolutionary puz¬ 
zles surrounding coronavirus spikes, extensive structural studies have been carried out using 
both X-ray crystallography and cryo-electron microscopy (cryo-EM) [12-14, 18-28]. These 
studies have resulted in structure determinations of SI from the a- and P-genera and spike 
ectodomains from the a-, P-, and 8-genera. Coronavirus spikes exist in two distinct conforma¬ 
tions: the pre-fusion structures are present on mature virions and have a clove-like shape with 
three SI heads sitting on top of a trimeric S2 stalk [12-14,18-20]; the post-fusion structures 
are the membrane-fusion state and have a dumbbell-like shape with three S2 subunits forming 
a six-helix bundle structure [29-35]. Whereas the structures of S2 from different genera are 
similar to each other in both the pre- and post-fusion states, the SI subunits from different 
genera diverge structurally and they also recognize a variety of host receptors [6, 7]. SI con¬ 
tains two domains, N-terminal domain (Sl-NTD) and C-terminal domain (Sl-CTD), either 
or both of which can function as the receptor-binding domain. The SI domains from different 
genera contain structural features that are unique to their genus. Sl-CTDs are particularly 
diverse, with low or no structural similarity across different genera [15]. Overall, these previ¬ 
ous studies have provided structural snapshots of coronavirus spikes from a-, P-, and 8-genera. 
However, because the structures of y-coronavirus spikes were still missing, we lacked a clear 
picture of the evolutionary relationships among coronavirus spikes from different genera. 

In the current study, we determined the cryo-EM structure of avian infectious bronchitis 
coronavirus (IBV) spike, the first such structure from the y genus. The IBV spike possesses 
structural features that are unique to different other genera, suggesting that it falls in the mid¬ 
dle of an evolutionary spectrum of coronavirus spikes. We also discuss how the structural evo¬ 
lution of coronavirus spikes has affected their functions as cell-invading molecular machinery. 
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Overall, our study has filled in a critical gap in the structural, functional and evolutionary stud¬ 
ies of coronavirus spikes, and deepened our understanding of viral evolutions in general. 

Materials and methods 

Expression and purification of IBV spike ectodomain 

IBV spike gene (virus strain M41; GenBank number ABI26423.1) was synthesized (Genscript) 
with codons optimized for insect cell expression. Its ectodomain (residues 20-1084) was 
cloned into pFastBac vector (Life Technologies Inc.) with a N-terminal honeybee melittin sig¬ 
nal peptide and C-terminal GCN4 and His 6 tags. It was expressed in Sf9 insect cells using the 
Bac-to-Bac system (Life Technologies Inc.) and purified as previously described [21]. Briefly, 
the protein was harvested from cell culture medium, and purified sequentially on Ni-NTA col¬ 
umn and Superdex200 gel filtration column (GE Healthcare). IBV Sl-CTD (residues 248-495) 
was expressed and purified in the same way as the IBV spike ectodomain, although it only con¬ 
tains a C-terminal His 6 tag and does not contain the GCN4 tag. 

Cryo-electron microscopy 

For sample preparation, aliquots of IBV spike ectodomain (3 pi, 0.35 mg/ml, in buffer contain¬ 
ing 2 mM Tris pH7.2 and 20 mM NaCl) were applied to glow-discharged CF-2/1-4C C-flat 
grids (Protochips). The grids were then plunge-frozen in liquid ethane using a FEI Marklll 
Vitrobot system (FEI Company). 

For data collection, images were recorded using a Gatan K2 Summit direct electron detec¬ 
tor in the direct electron counting mode (Gatan), attached to a FEI Titan-Krios TEM, at Ari¬ 
zona State University. The automated software SerialEM [36] was used to collect -2,000 total 
movies at 37,700x magnification and at a defocus range between 1 and 3 pm. Each movie had a 
total accumulated exposure of 53.66 el A 2 fractionated in 50 frames of 200 ms exposure. Data 
collection statistics are summarized in SI Table. 

For data processing, whole frames in each movie were corrected for beam-induced motion 
and dose compensation using MotionCor2 [37] and -1,400 best images were manually 
selected (we manually discarded micrographs with only carbon field of view or thick ice after 
motion correction as well as micrographs with defocus parameter higher than 4.5 pm after 
CTF estimation). The final image was bin-averaged to lead to a pixel size of 1.02 A. The param¬ 
eters of the microscope contrast transfer function were estimated for each micrograph using 
GCTF [38]. Particles were automatically picked and extracted using RELION [39] with a 
box size of 320 pixels. Initially, -802,000 particles were subjected to 2D alignment and cluster¬ 
ing using RELION, and the best classes were selected for an additional 2D alignment. -5,000 
best particles were applied for creating the initial 3D model using RELION. -170,000 particles 
selected from 2D alignment were then subjected to 3D classification and the best class with 
-100,000 particles were subjected to 3D refinement to generate the final density map. The 
final density map was sharpened with modulation transfer function of K2 operated at 300kV 
using RELION post-processing. Reported resolutions were based on the gold-standard Fourier 
shell correlation (FSC) = 0.143 criterion, and Fourier shell correction curves were corrected 
for the effects of soft masking by high-resolution noise substitution [40]. Data processing sta¬ 
tistics are summarized in SI Table. 

Model building and refinement 

The initial model of IBV spike ectodomain was obtained by fitting the seven parts (Sl-NTD, 
Sl-CTD, two parts of SD1, two parts of SD2, and S2) of the porcine delta coronavirus spike 
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structure (PDB ID: 6B7N) individually into the cryo-EM density map of IBV spike using 
UCSF Chimera [41] and Coot [42]. Manual model rebuilding was carried out using Coot 
based on the well-defined continuous density of the main chain; the side chain assignments 
were guided by the densities of N-linked glycans and bulky amino acid residues. The structural 
model of the IBV spike in the pre-fusion state was refined using Phenix [43] with geometry 
restrains and three-fold noncrystallographic symmetry constraints. Refinement and model 
rebuilding in Coot were carried out iteratively until there were no further improvements in 
geometry parameters and model-map correlation coefficient. The quality of the final model 
was analyzed with MolProbity [44] and EMRinger [45]. The validation statistics of the struc¬ 
tural models are summarized in SI Table. 

IBV pseudovirus entry assay 

IBV pseudovirus entry assay was carried out as previously described [46]. Briefly, full-length IBV 
spike gene was inserted into pcDNA3.1 (+) plasmid. Retroviruses pseudotyped with IBV spike 
and expressing a luciferase reporter gene were prepared through co-transfecting HEK293T cells 
(source: American Type Culture Collection) with a plasmid carrying Env-defective, luciferase- 
expressing HIV-1 genome (pNL4-3.1uc.RE) and the plasmid encoding IBV spike. The produced 
IBV pseudoviruses were harvested 72 hours post transfection, and then used to enter DF-1 cells 
(source: American Type Culture Collection) and HEK293T cells. After incubation for 5 hours at 
37 °C, the medium was changed and cells were incubated for an additional 60 hours. Cells were 
then washed with PBS and lysed. Aliquots of cell lysates were transferred to Optiplate-96 (PerkinEl- 
mer), followed by addition of luciferase substrate. Relative light units (RLUs) were measured using 
EnSpire plate reader (PerkinElmer). All the measurements were carried out in quadruplicates. 

Flow cytometry cell-binding assay 

Recombinant IBV Sl-CTD was assayed for its cell-binding capability using flow cytometry as 
previously described [13]. Briefly, HEK293T and DF-1 cells were incubated with recombinant 
IBV Sl-CTD containing a C-terminal His 6 tag (40 pg/ml) at room temperature for 30 minutes, 
followed by incubation with phycoerythrin (PE)-labeled anti-His 6 antibody for 30 minutes. 

The cells were then analyzed for the binding of IBV Sl-CTD using flow cytometry. 

Calculation of buried surface area of coronavirus Sl-CTDs 

The total surface area and buried surface area of coronavirus Sl-CTDs were calculated using 
the PISA server at the European Bioinformatics Institute (http://www.ebi.ac.uk/pdbe/prot_ 
int/pistart.html) [47]. Specifically, for each trimeric spike protein, a PDB file containing all of 
the six SI domains (including three copies of Sl-CTDs and three copies of Sl-NTDs) was sub¬ 
mitted to the PISA server, and the total surface area and buried surface area for each Sl-CTD 
were calculated. For the spike proteins used for the above analysis, all their Sl-CTDs were in 
the “lysing down” state. The structures of MERS-CoV and HKU1 spikes were not included in 
the above analysis because the former contain at least one Sl-CTD in the “standing up” state 
and the latter contains long stretches of missing residues in its SI domains, both of which 
would interfere with the above analysis. 

Results 

Overall structure of IBV spike 

We constructed the IBV spike ectodomain (from IBV strain M41) in the pre-fusion state by 
replacing its transmembrane anchor and intracellular domain with a C-terminal GCN4 
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trimerization tag, followed by a His 6 tag (Fig 1A). We expressed the protein in insect cells and 
purified the protein to homogeneity. We collected cryo-EM data on IBV spike ectodomain, 
calculated a density map at 3.93A resolution (Fig IB; SI Fig), built an atomic model of the 
structure and refined it (Fig 1C and ID). The final structural model contains all of the residues 
from 21 to 1022 (except residues 702-710) as well as glycans N-linked to 20 sites. Data collec¬ 
tion and model statistics are shown in SI Table. 

The overall structure of IBV spike ectodomain resembles the pre-fusion structures of coro- 
navirus spikes from the a-, (1-, and 8-genera [12-14,18-20]. It has a clove-like shape, with 
three SI heads forming a crown-like structure and sitting on top of a trimeric S2 stalk. Each 
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Fig 1. Overall structure of IBV spike ectodomain in the pre-fusion conformation. (A) Schematic drawing of IBV spike 
ectodomain. SI: receptor-binding subunit. S2: membrane-fusion subunit. GCN4-His 6 : GCN4 trimerization tag followed by His 6 
tag. Sl-NTD: N-terminal domain of SI. Sl-CTD: C-terminal domain of SI. CH: central helix. FP: fusion peptide. HR1 and HR2: 
heptad repeats 1 and 2. Residues in shaded regions (N-terminus, HR2, GCN4 tag, and His 6 tag) were not included in the 
structural model. Question mark indicates that the exact location of FP is uncertain; the range of FP used for making figures is 
consistent with a previous structural study on (3-genus mouse hepatitis coronavirus spike [18]. (B) Cryo-EM density maps of IBV 
spike ectodomain with atomic model fitted in. The maps have a contour of 7.5 a. (C) Cryo-EM structure of IBV spike 
ectodomain in the pre-fusion conformation. Each of the monomeric subunits is colored differently. (D) Structure of a 
monomeric subunit in the pre-fusion conformation. The structural elements are colored in the same way as in panel (A). Dotted 
line indicates residues 702-710 that are missing in the structural model. All structures are viewed from the side. 

https://doi.org/10.1371/journal.ppat.1007009.g001 
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Fig 2. Detailed structure of IBV spike ectodomain. (A) Schematic drawing of IBV SI. SD1: subdomain 1. SD2: subdomain 2. 
SD1’ and SD1”: two parts of SD1. SD2’ and SD2”: two parts of SD2. (B) Structure of monomeric SI. Sl-NTD is colored in 
cyan. Sl-CTD is colored in green. SD1 is colored in magenta. SD2 is colored in orange. * indicates putative sugar-binding site. 
Partial ceiling on top of the Sl-NTD core is labeled. Putative receptor-binding motif loops (RBMs) in Sl-CTD are also labeled. 
(C) Structure of trimeric SI. Three SI subunits are colored differently. (D) Structure of monomeric S2. The structural elements 
are colored in the same way as in Fig 1A. (E) Structure of trimeric S2. Dotted line indicates residues 702-710 that are missing 
in the structural model. The structural elements of subunit A are colored in the same way as in Fig ID. Subunits B and C are 
colored in light purple and light pink, respectively. All structures are viewed from the side. 

https://doi.org/10.1371/journal.ppat.1007009.g002 


monomeric subunit of SI contains two major domains, Sl-NTD and Sl-CTD, and two subdo¬ 
mains, SD1 and SD2 (Fig 2A and 2B). The Sl-CTDs from three different subunits sit on the 
top and center of the spike trimer, whereas the three Sl-NTDs are located on the lower and 
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outer side to Sl-CTDs (Fig 2C). SD1 and SD2 connect SI to S2. The interface of trimeric S2 
contains three central helices; each subunit of S2 contains one (Fig 2D and 2E). Each subunit 
of S2 also contains two heptad repeat regions, HR1 and HR2, and a fusion peptide (FP) (Fig 
2D and 2E). In the post-fusion structure of trimeric S2, three copies of HR1 and three copies 
of HR2 would refold into a six-helix bundle structure, and FP would insert into the target 
membrane [29-35]. As in the structures of other coronavirus spikes, the HR2 region (residues 
1022-1076) in the pre-fusion IBV spike is disordered (Figs 1A and 2D). The exact residue 
range of coronavirus FP remains unknown, although biochemical studies have identified a 
region in coronavirus S2 that associates with membranes and likely corresponds to FP (Fig 
2D) [48, 49]. In the following sections of this paper, we will compare the structures and func¬ 
tions of IBV spike to those of the spikes from the other three genera, and discuss the evolution 
of coronavirus spikes. 

Structural and functional evolution of coronavirus spike Sl-NTDs 

IBV Sl-NTD takes the same galectin fold as the Sl-NTDs from the other three coronavirus 
genera, but it contains unique structural features (Fig 3A 3D). Its core structure is a twelve- 
stranded (3-sandwich, which consists of two six-stranded antiparallel [3-sheet layers stacked 
together through hydrophobic interactions (Figs 2B and 3C). The topology of the [3-sandwich 
core is identical to that of human galectins (S2 Fig). Underneath the core structure is another 
]3-sheet and an a-helix, which are also present in the Sl-NTDs from the other three coronavi¬ 
rus genera. Above the core structure are some loops that form a partial ceiling-like structure. 
This structure is not present in human galectins or Sl-NTDs from a- or 8-genus, but becomes 
a more extensive ceiling-like structure in (3-coronavirus Sl-NTDs (Fig 3A, 3B and 3D). Based 
on the structure and function of [3-coronavirus Sl-NTDs, we previously predicted that 
Sl-NTDs from all of the genera have a galectin fold, and further correlated the galectin fold to 
their functions as viral lectins [15]. Recent structural studies, including the current one, have 
confirmed our previous structural predictions (S2 Fig). These studies also have unexpectedly 
revealed that the presence and extent of the ceiling-like structure on top of the core structure 
are unique structural features for Sl-NTDs from different genera. 

It has been known that IBV spike binds sugar [50]. A recent study further confirmed that 
the sugar-binding domain in IBV spike is its Sl-NTD [51]. To date no structural information 
is available for the complexes of coronavirus Sl-NTDs and their sugar ligand. Mutagenesis 
study showed that in the Sl-NTD from [3-genus bovine coronavirus (BCoV), the sugar-binding 
site is located in the pocket formed between the core structure and the ceiling [24]. In the 
structure of host galectins, despite no ceiling, the sugar-binding site is located in the same 
place [52], Based on the structural similarity between the Sl-NTDs from different coronavirus 
genera, the sugar-binding site in IBV Sl-NTD might also be located in the pocket formed 
between the core structure and the partial ceiling (Figs 2B and 3C). 

The structure determination of IBV Sl-NTD provides insight into the structural and func¬ 
tional evolution of coronavirus Sl-NTDs. We hypothesized that coronavirus Sl-NTDs origi¬ 
nated from host galectins based on the structural similarities between coronavirus Sl-NTDs 
and host galectins [23, 24]. As host proteins, galectins are not recognized by the host immune 
system. In comparison, coronavirus Sl-NTDs are under the host immune pressure to evolve. 
The gradual structural evolution of the ceiling on top of the core structure may have led to 
three functional outcomes. First, the ceiling could provide better protection to the sugar-bind¬ 
ing site from host immune surveillance, which appears to be a common feature of viral lectins 
[53]. This hypothesis on protected sugar-binding sites in viral lectins is also consistent with the 
“canyon hypothesis” which states that receptor-binding sites on viral surfaces are hidden from 
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Fig 3. Structural comparisons of Sl-NTDs from four coronavirus genera. (A) Structure of Sl-NTD from a-genus human 
coronavirus NL63 (PDB ID: 5SZS). Although each subunit of NL63 SI contains two copies of Sl-NTDs (i.e., S1-NTD1 and 
S1-NTD2), S1-NTD2 was used in structural comparisons with the Sl-NTDs from the other genera because it occupies the same 
location as the Sl-NTDs from the other genera in quaternary structures of the spikes (see Fig 5A). (B) Structure of Sl-NTD from 
8-genus porcine delta coronavirus (PdCoV) (PDB ID: 6B7N). (C) Structure of Sl-NTD from y-genus IBV. (D) Structure of 
Sl-NTD from |3-genus SARS coronavirus (PDB ID: 5X58). * indicates sugar-binding site or putative sugar-binding site in sugar- 
binding Sl-NTDs from each genus. Core structure, partial ceiling, and extensive ceiling are labeled. Arrows from panels (A) to 
(D) indicate evolutionary direction. (E) Quantitative structural comparisons among Sl-NTDs from different genera using 
software Dali [58], Both Z-score and r.m.s.d. were calculated for each pair of the proteins. PDB IDs for NL63, PdCoV and SARS 
Sl-NTDs are the same as in panels (A)-(D). PDB IDs for mouse hepatitis coronavirus (MHV) and MERS coronavirus are 3JCL 
and 5X5F, respectively. CEACAMlb (PDB ID: 5VST), whose [1-sandwich fold is topologically different from that of coronavirus 
Sl-NTDs [59], was used as a negative control. N.D.: no detectable structural similarity. 

https://doi.org/10.1371/journal.ppat.1007009.g003 


the host immune surveillance [54]. Second, the ceiling is also involved in the quaternary pack¬ 
ing of SI, which will be discussed later in this paper. Third, in the structure of Sl-NTD from [1- 
genus mouse hepatitis coronavirus (MHV), the outer surface of the ceiling has further evolved 
the capability to bind a protein receptor CEACAM1, while the presumed sugar-binding pocket 
has lost its capability to bind sugar [23]. Hence, the structural development of the ceiling is a 
possible indicator for the evolution of Sl-NTDs in the direction of a- and 8-genera, then the y- 
genus, and finally the (1-genus. Furthermore, we performed quantitative structural compari¬ 
sons of Sl-NTDs from different genera by calculating the Z-score and r.m.s.d. between each 
pair of the proteins (Fig 3E). The result confirmed that Sl-NTDs are relatively conserved 
among different genera, as reflected by the generally high Z-scores and low r.m.s.d. In terms of 
structural distances to a-coronavirus Sl-NTDs, 8-coronavirus Sl-NTDs are the closest, [i- 
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coronavims Sl-NTDs are the farthest, and y-coronavirus Sl-NTDs fall in the middle. More¬ 
over, the structural similarity between a- and 8-coronavirus Sl-NTDs is slightly higher than 
that between two [1-coronavirus Sl-NTDs, suggesting that Sl-NTDs within (1-genus have 
diverged slightly more than those between the a- and 8-genera. Taken together, Sl-NTDs 
from the four genera form an evolutionary spectrum in the order of a-, 8-, y-, and (1-genus, 
with a-coronavirus Sl-NTDs probably being the most ancestral (Fig 3A-3D). 


Structural and functional evolution of coronavirus spike Sl-CTDs 

The structure of IBV Sl-CTD is significantly different from the structures of Sl-CTDs from 
the other genera (Fig 4A-4D; S3 Fig). Its core structure is a (1-sandwich containing two [1-sheet 
layers: one is five-stranded and antiparallel, and the other is two-stranded and parallel (Figs 2B 
and 4C; S3 Fig). The interactions between the two (1-sheet layers are present but minimal. In 
contrast, the core structures of a-coronavirus and 8-coronavirus Sl-NTDs are both standard 
(1-sandwich folds with extensive interactions between the two (1-sheet layers: one is three- 
stranded and antiparallel, and the other is three-stranded and mixed (Fig 4A and 4B; S3 Fig). 
Even more drastically different are the (5-coronavirus Sl-CTDs, which contain only one five- 
stranded antiparallel (5-sheet layer with the other layer turning into an a-helix and a coil (Fig 
4D; S3 Fig). Despite these dramatic structural differences, the Sl-CTDs from all genera share 
the same structural topology (i.e., connectivity of secondary structural elements) (S3 Fig). 
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Fig 4. Structural comparisons of Sl-CTDs from four coronavirus genera. (A)-(D): Structures of Sl-CTDs from different 
genera. Core structures and RBMs are labeled. (E) Quantitative structural comparisons among Sl-CTDs from different genera. 
The PDB IDs are the same as those in Fig 3. Left right arrows from panels (A) to (D) indicate that evolution could go either way. 
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Moreover, the additional structural motifs on the edge of the core structure are also diverse 
among different genera (S3 Fig). In the IBV Sl-CTD, two extended loops on the edge of the 
core structure function as putative receptor-binding motifs (RBMs) by potentially binding to 
an unknown receptor (see below) (Figs 2B and 4C). In both the a- and 8-coronavirus Sl- 
CTDs, three short discontinuous loops are located in the same spatial region; they function as 
the RBMs in a-coronavirus Sl-CTDs and putative RBMs in 8-coronavirus Sl-CTDs (Fig 4A 
and 4B). In P-coronavirus Sl-CTDs, a long continuous subdomain is located in this spatial 
region and functions as the lone RBM (Fig 4D). Structural variations of the RBMs in the 
Sl-CTDs within each of the genera further lead to different receptor specificities [7], In sum, 
IBV Sl-CTD contains a weakened (1-sandwich core structure and two extended RBM loops; 
the former structural feature falls between the (1-sandwich cores of a- and 8-genera and the fi¬ 
shed core of (1-genus, whereas the latter structural feature falls between the three short discon¬ 
tinuous RBM loops of a- and 8-genera and a single long continuous RBM subdomain of P- 
genus. 

To investigate the function of IBV Sl-CTD, we performed two assays to detect possible 
interactions between IBV Sl-CTD and its potential receptor on the host cell surface. First, we 
carried out an IBV-spike-mediated pseudovirus entry assay in the presence of recombinant 
IBV Sl-CTD (S4A Fig). To this end, retroviruses pseudotyped with IBV spike (i.e., IBV pseu¬ 
doviruses) were used to enter host cells. In the absence of recombinant IBV Sl-CTD, IBV 
pseudoviruses entered DF-1 cells (chicken fibroblast) efficiently, which was consistent with a 
previous report showing that DF-1 cells are permissive to live IBV (strain M41) infections 
[55]. As a negative control, their entry into HEK293T cells (human kidney) was inefficient. 
Recombinant IBV Sl-CTD reduced the efficiency of IBV pseudovirus entry into DF-1 cells in 
a dose-dependent manner, likely because it competed with IBV pseudoviruses for an unknown 
receptor on the host cell surface. Second, we examined the binding of recombinant IBV 
Sl-CTD to the host cell surface using a flow cytometry assay (S4B Fig). To this end, recombi¬ 
nant IBV Sl-CTD was incubated with DF-1 cells, and subsequently cell-bound Sl-CTD was 
detected using flow cytometry. Recombinant IBV Sl-CTD bound to the surface of DF-1 cells 
efficiently, but not the surface of HEK293T cells. Taken together, IBV Sl-CTD binds to a yet- 
to-be-identified receptor on the surface of chicken cells and hence functions as a receptor¬ 
binding domain (RBD). Thus, the Sl-NTD and Sl-CTD of IBV spike may both function as 
RBDs. Because coronavirus Sl-CTDs from the a- and P-genera all use the additional structural 
features on the edge of their core structure as their RBMs, it is likely that the two extended 
loops in the same spatial region in IBV Sl-CTD function as the RBMs. 

Coronavirus Sl-CTDs represent remarkable examples of divergent evolution of viral pro¬ 
teins. The core structures and the RBM regions of Sl-CTDs are both divergent among differ¬ 
ent genera (Fig 4A-4D; S3 Fig). The core structures are P-sandwiches for a- and 8-coronavirus 
Sl-CTDs, weakened P-sandwiches for y-coronavirus Sl-CTDs, and single P-sheet layer for P- 
coronavirus Sl-CTDs. The RBMs are three short discontinuous loops for a- and 8-coronavirus 
Sl-CTDs, two reinforced loops for y-coronavirus Sl-CTDs, and a single continuous subdo¬ 
main for P-coronavirus Sl-CTDs. Hence the Sl-CTDs form an evolutionary spectrum, with a- 
and 8-coronavirus Sl-CTDs on one end, P-coronavirus Sl-CTDs on the other end, and y- 
coronavirus Sl-CTDs in between. We performed quantitative structural comparisons of 
Sl-CTDs from all four genera (Fig 4E). The result confirmed that Sl-CTDs are relatively 
poorly conserved among different genera, as reflected by the generally low Z-scores and high 
r.m.s.d. In terms of structural distances to a-coronavirus Sl-CTDs, 8-coronavirus Sl-CTDs 
are the closest, P-coronavirus Sl-CTDs are the farthest, and y-coronavirus Sl-CTDs fall in the 
middle. The functional outcomes of the core structure evolution are not clear, but the evolu¬ 
tion of the RBMs may have led to the following two functional outcomes. First, the diversity of 
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the RBMs from three short loops to two extended loops and then to a long subdomain may 
allow coronaviruses to explore a wider variety of receptors. Second, the reinforced RBM 
regions in both (5- and y-coronavirus Sl-NTDs facilitate quaternary packing of SI, which will 
be discussed later in this paper. Taken together, the Sl-CTDs from different genera form an 
evolutionary spectrum in the order of a-, 8-, y-, and [5-genus, although the evolutionary direc¬ 
tion could go either way (Fig 4A-4D). 


Evolution of quaternary packing of coronavirus SI 

Curiously, coronavirus SI from different genera take two types of quaternary packing modes 
(Fig 5A-5D) [12-14, 18-20]. IBV SI takes a cross-subunit quaternary packing mode where the 
Sl-NTD and Sl-CTD from different subunits pack together (Fig 5C). Specifically, in the tri- 
meric IBV spike, one Sl-CTD packs against two Sl-CTDs from the other subunits as well as 
one Sl-NTD from another subunit. The putative RBMs of IBV Sl-CTD and the partial ceiling 
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Fig 5. Quaternary packing of SI from four coronavirus genera. (A)-(D) Structures of trimeric SI from different genera. 
Three SI subunits are colored differently. The PDB IDs are the same as those in Fig 3. All structures are viewed from the side. 
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of IBV Sl-NTD are both involved in the cross-subunit packing. As a result, the putative RBMs 
of IBV Sl-CTD are partially concealed, disallowing their full access to the host receptor. Hence 
IBV Sl-CTD in the current structure was captured in a “lying down” state, and would need to 
“stand up” on the spike trimer for efficient receptor binding. This potential conformational 
change of IBV SI can minimize the exposure of the putative RBMs in its Sl-CTD to the 
immune system, thereby functioning as a possible strategy for viral immune evasion. [1-coro¬ 
na virus SI also takes the cross-subunit packing mode, with the RBM of its Sl-CTD and the 
ceiling of its Sl-NTD both involved in the cross-subunit packing (Fig 5D) [18-20]. In contrast, 
a- and 8-coronavirus SI both take an intra-subunit packing mode where the Sl-NTD and 
Sl-CTD from the same subunit pack together (Fig 5A and 5B) [12-14]. The RBMs of a- and 
8-coronavirus Sl-CTDs are involved in the intra-subunit packing. Whether SI packs in the 
intra-subunit or cross-subunit mode, the RBMs of Sl-CTDs are concealed or partially con¬ 
cealed in their “lying down” state, and would need to switch to the “standing up” state for 
receptor binding. Overall, P- and y-coronavirus SI both take the cross-subunit quaternary 
packing mode, whereas a- and 8-coronavirus SI both take the intra-subunit quaternary pack¬ 
ing mode. 

We examined whether the quaternary structures of coronavirus SI can lead to functional 
differences of coronavirus spikes. First, in both P- and y-coronavirus spikes, the RBMs of their 
Sl-CTDs and the ceiling of their Sl-NTDs have evolved to facilitate the cross-subunit packing. 
These additional structural features further evolved to gain other functions: the RBMs of 
Sl-CTDs recognize diverse protein receptors, whereas the ceiling of the Sl-NTDs either pro¬ 
tects the sugar-binding site or recognizes a new protein receptor [7]. Second, to investigate the 
structural restrain on Sl-CTDs that may hinder their potential conformational change, we cal¬ 
culated the total and buried surface areas of the Sl-CTD on the spikes from different genera. 
The result did not reveal systematic difference between intra-subunit SI packing and cross¬ 
subunit SI packing in the buried surface area of Sl-CTDs. However, it is worth noting that the 
Sl-CTD from P-genus SARS-CoV has the smallest buried surface area (in both absolute value 
and percentage) (S2 Table). The relative small buried surface area of SARS-CoV Sl-CTD indi¬ 
cates less structural restraint on the Sl-CTD from other parts of the spike SI, possibly allowing 
the Sl-CTD to switch to the “standing up” and receptor-accessible conformation more easily. 
The “standing up” conformation of SARS-CoV Sl-CTD may also weaken the structural 
restraint of SI on S2 (discussed in more detail later), potentially allowing membrane fusion to 
proceed more easily [56], Indeed, frequent “standing up” of SARS-CoV Sl-CTD has been 
observed [19]. Overall, compared to the intra-subunit quaternary packing of a- and 8-corona- 
virus SI, the cross-subunit quaternary packing of P- and y-coronavirus SI may have allowed 
their SI to evolve additional functions in receptor recognition; moreover, the Sl-CTD from P- 
genus SARS-CoV spike has a relatively small buried surface area, which maybe responsible for 
its dynamic receptor-binding conformation. 

Structural and functional evolution of coronavirus spike S2 

The structure and function of IBV S2 are highly similar to those of S2 from the other coronavi¬ 
rus genera. In the pre-fusion structure of IBV S2, HR2 is disordered, whereas HR1 and FP 
each consist of several a-helices and connecting loops (the exact residue range of FP is not 
clear) (Fig 2D). In the post-fusion structure, HR1 would refold into a long a-helix, HR2 would 
refold into a mixture of a-helices and coils, three copies of HR1 and HR2 would pack into a 
six-helix bundle structure, and FP would also refold and insert into the target membrane (S5A 
Fig) [6, 29]. IBV S2 is locked in the pre-fusion state because of the structural restraint from SI. 
Specifically, because of the cross-subunit quaternary packing of trimeric IBV SI, HR1 and FP 
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of IBV S2 are structurally restrained by two Sl-CTDs from the other subunits and SD1 from 
another subunit, respectively (S5B Fig). The structural restraints from SI on S2 can be weak¬ 
ened by the standing up of Sl-CTDs (which allows receptor binding) and can be lifted 
completely upon proteolysis removal of SI. The packing between SI and S2 in IBV spike is the 
same as those in P-coronavirus spikes [18-20]. However, in a- and 8-coronavirus spikes, the 
packing between SI and S2 becomes different due to the intra-subunit quaternary packing of 
their trimeric SI: HR1 and FP are restrained by one Sl-CTD and one SD1 from another sub¬ 
unit, respectively (S5C Fig) [12-14]. Other than the differences in S1/S2 packing, the structural 
and functional similarities of coronavirus S2 from different genera suggest evolutionary con¬ 
servation of coronavirus S2. 


Discussion 

The fast evolutionary rates of viruses, particularly RNA viruses, make it difficult to trace their 
evolutionary history [1-3]. Envelope-anchored coronavirus spike proteins guide viral entry 
into cells; they are the fastest evolving coronavirus proteins due to viral needs to engage diverse 
host receptors, maximize membrane-fusion efficiency, and evade host immune surveillance 
[7-14]. Coronavirus spikes from four different genera are divergent, and their evolutionary 
relationships pose a major puzzle in the virology field [6]. Because viral proteins need to func¬ 
tion under certain structural and functional constraints, evolutionary information of viral pro¬ 
teins can be more reliably found in their tertiary structures and related functions than in their 
primary structures [16, 17]. Although extensive structural studies including both X-ray crystal¬ 
lography and cryo-EM have been done on coronavirus spikes, a critical piece that was still 
missing is the structure of y-coronavirus spikes, preventing a clear understanding of the evolu¬ 
tionary relationships among coronavirus spikes [12-14,18-28]. In this study, we determined 
the cryo-EM structure of IBV spike ectodomain, the first such structure from the y-genus, 
which bridges the divergent structures of coronavirus spikes into an evolutionary spectrum 
and provides insight into the evolutionary relationships among coronavirus spikes. 

Our study compares the structures and functions of coronavirus spikes from the four gen¬ 
era, and illustrates the structural and functional evolution of these proteins. First, coronavirus 
Sl-NTDs from all genera share the same structural fold and possibly evolutionary origins with 
host galectins. From a- and 5- genera to y genus and then to p genus, the Sl-NTDs have 
evolved from simple galectin-fold core structure with an exposed sugar-binding site, to having 
a partial ceiling on top of the core structure, and to having an extensive ceiling to protect the 
sugar-binding site from host immune surveillance (the outer surface of the ceiling in one P- 
coronavirus can even bind to a novel protein receptor). The partial ceiling in y-coronavirus 
Sl-NTDs and the extensive ceiling in P-coronavirus Sl-NTDs are also involved in the quater¬ 
nary packing of SI. Second, coronavirus Sl-CTDs from different genera are very diverse, but 
still form an evolutionary spectrum with a- and P-coronavirus Sl-CTDs at two ends and 8- 
and y-coronavirus Sl-CTDs in the middle. The core structures of Sl-CTDs have diverged 
from P-sandwich to weakened P-sandwich and then to P-sheet, whereas the RBMs have 
diverged from short loops to extended loops and then to a long subdomain. The functional sig¬ 
nificance of the core structure evolution is not clear, but the RBM evolution may allow the 
viruses to expand receptor recognition and also contributes to the quaternary packing of SI. 
Third, from a- and 8- genera to P- and y-genera, the quaternary packing of SI has diverged 
from simple intra-subunit packing to more complex cross-subunit packing. The cross-subunit 
quaternary packing of P- and y-coronavirus SI may have allowed their SI to evolve additional 
functions in receptor recognition. Moreover, the relatively small buried surface area of the 
Sl-CTD from P-genus SARS-CoV may allow the Sl-CTD to be more dynamic for receptor 
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binding. Finally, the S2 from all four genera are structurally and functionally conserved, 
although there are some differences in their S1/S2 packing. Quantitative structural comparisons 
also demonstrate that in terms of structural distances to a-coronavirus SI, 8-coronavirus SI is 
the closest, P-coronavirus SI is the farthest, and y-coronavirus SI is the intermediate. We also 
calculated the phylogenetic tree using the amino acid sequences of 29 coronavirus spikes from 
different genera, and the result showed that in terms of amino acid sequence distances to a- 
coronavirus spikes, 8-coronavirus spike is the closest, P-coronavirus spike is the farthest, and y- 
coronavirus spike is the intermediate (S6 Fig). Taken together, coronavirus spikes from differ¬ 
ent genera form an evolutionary spectrum, with a-coronavirus spikes on one end, followed by 
8-coronavirus spikes and y-coronavirus spikes, and P-coronavirus spikes on the other end. 

Because of their fast evolutionary rates, viruses are perfect model systems for studying evo¬ 
lution. Our study has demonstrated that despite structural divergence among coronavirus 
spikes, particularly in their SI where low or little structural similarities can be detected, we can 
still trace the evolutionary relationships among these viral proteins through detailed analyses 
of their structures and functions. Our study also reveals that coronavirus spikes have evolved 
to remarkable diversity to expand their receptor recognition, facilitate membrane fusion, and 
evade host immune surveillance, while conserving basic membrane-fusion mechanisms. The 
evolutionary approaches that coronaviruses take and the evolutionary edges that they gain are 
good examples of viral evolution and deepen our understanding of evolution in general. 


Supporting information 

51 Table. Data collection and model validation statistics. 

(DOC) 

52 Table. Buried surface area of coronavirus spike Sl-CTDs. 

(DOC) 

51 Fig. Cryo-EM data analysis of IBV spike ectodomain. (A) Representative micrograph of 
frozen-hydrated IBV spike ectodomain particles (top) and representative 2D class averages in 
different orientations (bottom). (B) Gold-standard Fourier shell correlation (FSC) curves. The 
resolution was determined to be 3.93 A. The 0.143 cut-off value is indicated by a horizontal 
red bar. 

(TIF) 

52 Fig. Structural topology of coronavirus Sl-NTDs. (A) Structural topology of the core 
structure of human galectin-3 (PDB ID: 1A3K). (B) Structural topology of the core structures 
of a-, y-, and 8-coronavirus Sl-NTDs. (C) Structural topology of the core structures of [i-coro- 
navirus Sl-NTD. PDB IDs of coronavirus Sl-NTDs are the same as in Fig 3. P-strands are 
shown as arrows. The two layers of the core structures are colored in green and magenta, 
respectively. N* and C' indicate N- and C-terminus, respectively. Numbering of the secondary 
structures only counts secondary structural elements in the core region. 

(TIF) 

53 Fig. Structural topology of coronavirus Sl-CTDs. (A) Structural topology of the core 
structures of a- and 8-coronavirus Sl-CTDs. (B) Structural topology of the core structure ofy- 
coronavirus Sl-CTD. (C) Structural topology of the core structure of P-coronavirus Sl-CTD. 
PDB IDs of coronavirus Sl-CTDs are the same as in Fig 4. P-strands are shown as arrows, a- 
helices are shown as cylinders. Coil is shown as a curled line. The two layers of the core struc¬ 
tures are colored in green and magenta, respectively. Receptor-binding motifs (RBMs) are col¬ 
ored in red and the relative lengths of the RBMs are labeled in parentheses. In both y- and 8- 
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coronavirus Sl-CTDs, the RBMs have not been experimentally identified and thus their func¬ 
tions are putative. N* and C* indicate N- and C-terminus, respectively. Numbering of the sec¬ 
ondary structures only counts secondary structural elements in the core region. 

(TIF) 

54 Fig. Function of IBV Sl-CTD. (A) IBV pseudovirus entry into cells in the presence of 
recombinant IBV Sl-CTD. Entry efficiency was characterized by luciferase activity accompa¬ 
nying entry. RLU: relative light units. Mock: no IBV pseudoviruses were added. Entry: IBV 
pseudovirus entry in the absence of any recombinant IBV Sl-CTD. (B) Flow cytometry assay 
for the binding of recombinant IBV Sl-CTD to the surface of cells. Cell-bound IBV Sl-CTD 
was detected using antibodies recognizing its C-terminal His 6 tag. Cells only or cells plus anti¬ 
body without IBV Sl-CTD were used as negative controls. Statistic analyses were performed 
using two-tailed t-test. Error bars indicate S.E.M. (n = 4). *** P<0.001. ** P<0.01. * P< 0.05. 
N.S.: no statistical significance. 

(TIF) 

55 Fig. Structure and function of IBV S2. (A) Structures of monomeric (5-genus MHV S2 in 
the pre-fusion conformation (left; PDB ID: 3JCL) and post-fusion conformation (right; PDB 
ID: 6B30). Structural elements in monomeric S2 are colored in the same way as in Fig 2D. 
Arrow in the pre-fusion structure indicates the direction in which HR1 would need to extend 
to reach the post-fusion conformation. (B) Packing between SI and S2 in IBV spike. Trimeric 

51 and one monomeric S2 are shown. Structural elements in monomeric S2 are colored in the 
same way as in panel (A). Three SI subunits are colored differently. (C) Packing between SI 
and S2 in porcine delta coronavirus spike (PDB ID: 6B7N). Trimeric SI and one monomeric 

52 are shown. SI and S2 are colored in the same way as in panel (B). All structures are viewed 
from the side. 

(TIF) 

56 Fig. Phylogenetic tree derived from the amino acid sequences of 29 coronavirus spikes. 

The phylogenetic tree was constructed using the neighbor-joining method as previously 
described [57]. Horizontal scale bars represent average numbers of substitutions per amino 
acid position. The GenBank accession numbers of the selected spikes are marked before each 
virus name. 

(TIF) 

Acknowledgments 

Initial cryo-EM images were collected using FEI Tecnai TEMs maintained at the Characteriza¬ 
tion Facility of the University of Minnesota. Final cryo-EM data were collected at the John M. 
Cowley Center for High Resolution Electron Microscopy of Arizona State University. We 
thank Dr. Dewight Williams for helping us prepare grids and collect data. Initial image analy¬ 
sis and computation work were performed using the workstations at the Basic Sciences Com¬ 
puting Laboratory of the University of Minnesota Supercomputing Institute. 

Author Contributions 

Conceptualization: Jian Shang, Yuan Zheng, Wei Zhang, Fang Li. 

Data curation: Jian Shang, Yuan Zheng, Yang Yang, Chang Liu, Qibin Geng, Churning Luo. 

Formal analysis: Jian Shang, Yuan Zheng, Yang Yang, Chang Liu, Qibin Geng, Churning Luo, 
Wei Zhang, Fang Li. 


PLOS Pathogens | https://doi.org/10.1371/journal.ppat.1007009 April 23, 2018 


15/19 




9'PLOS I PATHOGENS 


Structure, function, and evolution of IBV spike protein 


Funding acquisition: Wei Zhang, Fang Li. 

Investigation: Wei Zhang, Fang Li. 

Project administration: Fang Li. 

Resources: Fang Li. 

Supervision: Fang Li. 

Validation: Jian Shang, Yuan Zheng, Yang Yang, Chang Liu, Qibin Geng, Churning Luo, Wei 
Zhang, Fang Li. 

Writing - original draft: Fang Li. 

Writing - review & editing: Jian Shang, Yuan Zheng, Yang Yang, Chang Liu, Qibin Geng, 
Churning Luo, Wei Zhang. 


References 

1. Graham RL, Baric RS. Recombination, reservoirs, and the modular spike: mechanisms of coronavirus 
cross-species transmission. J Virol. 2010; 84(7):3134^t6. Epub 2009/11/13. https://doi.org/10.1128/ 
JVI.01394-09 PMID: 19906932; PubMed Central PMCID: PMC2838128. 

2. Li WH, Wong SK, Li F, Kuhn JH, Huang 1C, Choe H, et al. Animal origins of the severe acute respiratory 
syndrome coronavirus: Insight from ACE2-S-protein interactions. Journal of Virology. 2006; 80 

(9) :4211-9. https://doi.Org/10.1128/JVI.80.9.4211 -4219.2006 ISI:000236980000001. PMID: 16611880 

3. Li F. Receptor recognition and cross-species infections of SARS coronavirus. Antiviral Res. 2013; 100 

(I) :246-54. Epub 2013/09/03. https://doi.Org/10.1016/j.antiviral.2013.08.014 PMID: 23994189. 

4. Gonzaalez JM, Gomez-Puertas P, Cavanagh D, Gorbalenya AE, Enjuanes L. A comparative sequence 
analysis to revise the current taxonomy of the family Coronaviridae. Archives of Virology. 2003; 148 

(II) :2207-35. https://doi.org/10.1007/s00705-003-0162-1 WOS:000186399900009. PMID: 14579179 

5. Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nature 
Reviews Microbiology. 2009; 7(6):439-50. https://doi.org/10.1038/nrmicro2147 
WOS:000266451100012. PMID: 19430490 

6. Li F. Structure, Function, and Evolution of Coronavirus Spike Proteins. Annual review of virology. 2016; 3 
(1):237—61. Epub 2016/09/01. https://doi.Org/10.1146/annurev-virology-110615-042301 PMID: 27578435. 

7. Li F. Receptor recognition mechanisms of coronaviruses: a decade of structural studies. J Virol. 2015; 
89(4):1954-64. Epub 2014/11/28. https://doi.Org/10.1128/JVI.02615-14 PMID: 25428871 ; PubMed 
Central PMCID: PMCPmc4338876. 

8. Belouzard S, Millet JK, Licitra BN, Whittaker GR. Mechanisms of coronavirus cell entry mediated by the 
viral spike protein. Viruses. 2012; 4(6):1011-33. Epub 2012/07/21. https://doi.org/10.3390/v4061011 
PMID: 22816037; PubMed Central PMCID: PMC3397359. 

9. Millet JK, Whittaker GR. Host cell proteases: Critical determinants of coronavirus tropism and pathogen¬ 
esis. Virus Res. 2015; 202:120-34. Epub 2014/12/03. https://doi.Org/10.1016/j.virusres.2014.11.021 
PMID: 25445340; PubMed Central PMCID: PMCPmc4465284. 

10. Heald-Sargent T, Gallagher T. Ready, set, fuse! The coronavirus spike protein and acquisition of fusion 
competence. Viruses. 2012; 4(4):557-80. Epub 2012/05/17. https://doi.org/10.3390/v4040557 PMID: 
22590686; PubMed Central PMCID: PMCPmc3347323. 

11. Yang ZY, Werner HC, Kong WP, Leung K, Traggiai E, Lanzavecchia A, et al. Evasion of antibody neu¬ 
tralization in emerging severe acute respiratory syndrome coronaviruses. Proc Natl Acad Sci USA. 
2005; 102(3):797-801. Epub 2005/01/12. https://doi.org/10.1073/pnas.0409065102 PMID: 15642942; 
PubMed Central PMCID: PMCPMC545557. 

12. Walls AC, Tortorici MA, Frenz B, Snijder J, Li W, Rey FA, et al. Glycan shield and epitope masking of a 
coronavirus spike protein observed by cryo-electron microscopy. Nat Struct Mol Biol. 2016; 23 

(10) :899-905. Epub 2016/09/13. https://doi.org/10.1038/nsmb.3293 PMID: 27617430. 

13. Shang J, Zheng Y, Yang Y, Liu C, Geng Q, Tai W, et al. Cryo-EM structure of porcine delta coronavirus 
spike protein in the pre-fusion state. J Virol. 2017. Epub 2017/10/27. https://d 0 i. 0 rg/l 0.1128/jvi.01556- 
17 PMID: 29070693. 

14. Xiong X, Tortorici MA, Snijder J, YoshiokaC, Walls AC, Li W, et al. Glycan shield and fusion activation 
of a deltacoronavirus spike glycoprotein fine-tuned for enteric infections. J Virol. 2017. Epub 2017/11/ 

03. https://doi.org/10.1128/jvi.01628-17PMID: 29093093. 


PLOS Pathogens | https://doi.org/10.1371/journal.ppat.1007009 April 23, 2018 


16/19 




9'PLOS I PATHOGENS 


Structure, function, and evolution of IBV spike protein 


15. Li F. Evidence for a Common Evolutionary Origin of Coronavirus Spike Protein Receptor-Binding Sub¬ 
units. Journal of Virology. 2012; 86(5):2856-8. https://doi.Org/10.1128/JVI.06882-11 
WOS:000300536800043. PMID: 22205743 

16. Siltberg-Liberles J, Grahnen JA, Liberies DA. The evolution of protein structures and structural ensem¬ 
bles under functional constraint. Genes. 2011; 2(4):748-62. Epub2011/01/01. https://doi.org/10.3390/ 
genes2040748 PMID: 24710290; PubMed Central PMCID: PMCPMC3927589. 

17. Goldstein RA. The structure of protein evolution and the evolution of protein structure. Curr Opin Struct 
Biol. 2008; 18(2):170-7. Epub 2008/03/11. https://doi.Org/10.1016/j.sbi.2008.01.006 PMID: 18328690. 

18. Walls AC, Tortorici MA, Bosch BJ, Frenz B, Rottier PJ, DiMaio F, et al. Cryo-electron microscopy struc¬ 
ture of a coronavirus spike glycoprotein trimer. Nature. 2016; 531 (7592): 114-7. Epub 2016/02/09. 

https://doi.org/10.1038/nature16988 PMID: 26855426. 

19. Yuan Y, Cao D, Zhang Y, Ma J, Qi J, Wang Q, et al. Cryo-EM structures of MERS-CoV and SARS-CoV 
spike glycoproteins reveal the dynamic receptor binding domains. Nature communications. 2017; 
8:15092. Epub 2017/04/11. https://doi.org/10.1038/ncomms15092 PMID: 28393837. 

20. Kirchdoerfer RN, Cottrell CA, Wang N, Pallesen J, Yassine HM, Turner HL, et al. Pre-fusion structure of 
a human coronavirus spike protein. Nature. 2016; 531 (7592):118—21. Epub 2016/03/05. https://doi.org/ 
10.1038/nature 17200 PMID: 26935699. 

21. Li F, Li WH, Farzan M, Harrison SC. Structure of SARS coronavirus spike receptor-binding domain 
complexed with receptor. Science. 2005; 309(5742):1864-8. https://doi.Org/10.1126/science. 1116480 
ISL000231989500052. PMID: 16166518 

22. Wu K, Li W, Peng G, Li F. Crystal structure of NL63 respiratory coronavirus receptor-binding domain 
complexed with its human receptor. Proc Natl Acad Sci USA. 2009; 106(47):19970-4. Epub 2009/11/ 

11. https://doi.org/10.1073/pnas.0908837106 PMID: 19901337; PubMed Central PMCID: 
PMCPmc2785276. 

23. Peng GQ, Sun DW, Rajashankar KR, Qian ZH, Holmes KV, Li F. Crystal structure of mouse coronavirus 
receptor-binding domain complexed with its murine receptor. Proceedings of the National Academy of 
Sciences of the United States of America. 2011; 108(26):10696-701. https://doi.org/10.1073/pnas. 

11 04306108 ISI:000292251000064. PMID: 21670291 

24. Peng GQ, Xu LQ, Lin YL, Chen L, Pasquarella JR, Holmes KV, et al. Crystal Structure of Bovine Coro¬ 
navirus Spike Protein Lectin Domain. Journal of Biological Chemistry. 2012; 287(50):41931-8. https:// 
doi.org/10.1074/jbc.M112.418210 WOS:000312103000033. PMID: 23091051 

25. Raj VS, Mou HH, Smits SL, Dekkers DHW, Muller MA, Dijkman R, et al. Dipeptidyl peptidase 4 is afunc¬ 
tional receptor for the emerging human coronavirus-EMC. Nature. 2013; 495(7440):251-4. https://doi. 
org/10.1038/naturel2005 WOS:000316652300054. PMID: 23486063 

26. Reguera J, Santiago C, Mudgal G, Ordono D, Enjuanes L, Casasnovas JM. Structural bases of corona¬ 
virus attachment to host aminopeptidase N and its inhibition by neutralizing antibodies. PLoS Pathog. 
2012 ; 8 ( 8 ). 

27. Chen Y, Rajashankar KR, Yang Y, Agnihothram SS, Liu C, Lin YL, et al. Crystal structure of the recep¬ 
tor-binding domain from newly emerged Middle East respiratory syndrome coronavirus. J Virol. 2013; 

87(19): 10777-83. Epub 2013/08/02. https://d 0 i. 0 rg/l 0.1128/JVI.01756-13 PMID: 23903833; PubMed 
Central PMCID: PMCPmc3807420. 

28. Lu G, Hu Y, Wang Q, Qi J, Gao F, Li Y, et al. Molecular basis of binding between novel human coronavi¬ 
rus MERS-CoV and its receptor CD26. Nature. 2013; 500(7461 ):227-31. Epub 2013/07/09. https://doi. 
org/10.1038/naturel2328 PMID: 23831647. 

29. Walls AC, Tortorici MA, Snijder J, Xiong X, Bosch BJ, Rey FA, etal. Tectonic conformational changes 
of a coronavirus spike glycoprotein promote membrane fusion. Proc Natl Acad Sci USA. 2017; 114 
(42):11157—62. Epub 2017/10/27. https://doi.org/10.1073/pnas.1708727114 PMID: 29073020; 

PubMed Central PMCID: PMCPMC5651768. 

30. Li F, Berardi M, Li WH, Farzan M, Dormitzer PR, Harrison SC. Conformational states of the severe 
acute respiratory syndrome coronavirus spike protein ectodomain. Journal of Virology. 2006; 80 
(14):6794-800. https://doi.org/10.1128/JVI.02744-05 IS 1:000238770000008. PMID: 16809285 

31. Xu Y, Liu Y, Lou Z, Qin L, Li X, Bai Z, et al. Structural basis for coronavirus-mediated membrane fusion. 
Crystal structure of mouse hepatitis virus spike protein fusion core. J Biol Chem. 2004; 279(29):30514- 
22. Epub 2004/05/05. https://doi.org/10.1074/jbc.M403760200 PMID: 15123674. 

32. Xu YH, Lou ZY, Liu YW, Pang H, Tien P, Gao GF, et al. Crystal structure of severe acute respiratory 
syndrome coronavirus spike protein fusion core. Journal of Biological Chemistry. 2004; 279(47):49414- 
9. https://doi.org/10.1074/jbc.M408782200 WOS:000225098100110. PMID: 15345712 

33. Zheng Q, Deng Y, Liu J, van der Hoek L, Berkhout B, Lu M. Core structure of S2 from the human coro¬ 
navirus NL63 spike Glycoprotein. Biochemistry. 2006; 45(51): 15205-15. https://doi.org/10.1021/ 
bi061686w WQS:000242935600008. PMID: 17176042 


PLOS Pathogens | https://doi.org/10.1371/journal.ppat.1007009 April 23, 2018 


17/19 




9'PLOS I PATHOGENS 


Structure, function, and evolution of IBV spike protein 


34. Lu L, Liu Q, Zhu Y, Chan KH, Qin L, Li Y, et al. Structure-based discovery of Middle East respiratory 
syndrome coronavirus fusion inhibitor. Nature communications. 2014; 5:3067. Epub 2014/01/30. 

https://doi.org/10.1038/ncomms4067 PMID: 24473083. 

35. Duquerroy S, Vigouroux A, Rottier PJ, Rey FA, Bosch BJ. Central ions and lateral asparagine/glutamine 
zippers stabilize the post-fusion hairpin conformation of the SARS coronavirus spike glycoprotein. Virol¬ 
ogy. 2005; 335(2):276-85. Epub 2005/04/21. https://doi.Org/10.1016/j.virol.2005.02.022 PMID: 
15840526. 

36. Mastronarde DN. Automated electron microscope tomography using robust prediction of specimen 
movements. Journal of structural biology. 2005; 152(1 ):36—51. Epub 2005/09/27. https://doi.Org/10. 

1016/j.jsb.2005.07.007 PMID: 16182563. 

37. Li X, Mooney P, Zheng S, Booth CR, Braunfeld MB, Gubbens S, et al. Electron counting and beam- 
induced motion correction enable near-atomic-resolution single-particle cryo-EM. Nature methods. 
2013; 10(6):584-90. Epub 2013/05/07. https://doi.org/10.1038/nmeth.2472 PMID: 23644547; PubMed 
Central PMCID: PMCPMC3684049. 

38. Zhang K. Gctf: Real-time CTF determination and correction. Journal of structural biology. 2016; 193 

(1): 1 —12. Epub 2015/11/26. https://doi.Org/10.1016/j.jsb.2015.11.003 PMID: 26592709; PubMed Cen¬ 
tral PMCID: PMCPMC4711343. 

39. Scheres SH. RELION: implementation of a Bayesian approach to cryo-EM structure determination. 
Journal of structural biology. 2012; 180(3):519-30. Epub 2012/09/25. https://doi.Org/10.1016/j.jsb. 
2012.09.006 PMID: 23000701; PubMed Central PMCID: PMCPMC3690530. 

40. Chen S, McMullan G, Faruqi AR, Murshudov GN, Short JM, Scheres SH, et al. Fligh-resolution noise 
substitution to measure overfitting and validate resolution in 3D structure determination by single parti¬ 
cle electron cryomicroscopy. Ultramicroscopy. 2013; 135:24-35. Epub 2013/07/23. https://doi.org/10. 

1016/j.ultramic.2013.06.004 PMID: 23872039; PubMed Central PMCID: PMCPMC3834153. 

41. Goddard TD, Huang CC, Ferrin TE. Visualizing density maps with UCSF Chimera. J Struct Biol. 2007; 

157(1 ):281—7. https://doi.Org/10.1016/j.jsb.2006.06.010PMID: 16963278. 

42. Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D 
Biol Crystallogr. 2010; 66(Pt 4):486-501. https://d 0 i. 0 rg/l0.1107/S0907444910007493 PMID: 
20383002; PubMed Central PMCID: PMC2852313. 

43. Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: a comprehensive 
Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr. 2010; 
66(Pt 2):213-21. https://doi.Org/10.1107/S0907444909052925 PMID: 20124702; PubMed Central 
PMCID: PMC2815670. 

44. Chen VB, Arendall WB 3rd, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, et al. MolProbity: all¬ 
atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr. 2010; 

66(Pt 1): 12—21. https://doi.Org/10.1107/S0907444909042073 PMID: 20057044; PubMed Central 
PMCID: PMC2803126. 

45. Barad BA, Echols N, Wang RY, Cheng Y, DiMaio F, Adams PD, et al. EMRinger: side chain-directed 
model and map validation for 3D cryo-electron microscopy. Nat Methods. 2015; 12(10):943—6. https:// 
doi.org/10.1038/nmeth.3541 PMID: 26280328; PubMed Central PMCID: PMC4589481. 

46. Yang Y, Du L, Liu C, Wang L, Ma C, Tang J, et al. Receptor usage and cell entry of bat coronavirus 
HKU4 provide insight into bat-to-human transmission of MERS coronavirus. Proc Natl Acad Sci USA. 
2014; 111(34): 12516-21. Epub 2014/08/13. https://doi.org/10.1073/pnas.1405889111 PMID: 
25114257; PubMed Central PMCID: PMCPmc4151778. 

47. Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007; 
372(3):774-97. Epub 2007/08/08. https://doi.Org/10.1016/j.jmb.2007.05.022 PMID: 17681537. 

48. Ou X, Zheng W, Shan Y, Mu Z, Dominguez SR, Holmes KV, et al. Identification of the Fusion Peptide- 
Containing Region in Betacoronavirus Spike Glycoproteins. J Virol. 2016; 90(12):5586-600. Epub 
2016/04/01. https://doi.org/10-1128/JVI.00015-16 PMID: 27030273; PubMed Central PMCID: 
PMCPMC4886789. 

49. Basso LG, Vicente EF, Crusca E Jr., Cilli EM, Costa-Filho AJ. SARS-CoV fusion peptides induce mem¬ 
brane surface ordering and curvature. Scientific reports. 2016; 6:37131. Epub 2016/11/29. https://doi. 
org/10.1038/srep37131 PMID: 27892522; PubMed Central PMCID: PMCPMC5125003. 

50. Wickramasinghe IN, de Vries RP, Grone A, de Haan CA, Verheije MH. Binding of avian coronavirus 
spike proteins to host factors reflects virus tropism and pathogenicity. J Virol. 2011; 85(17):8903-12. 
Epub 2011/06/24. https://doi.org/10.1128/JVI.05112-11 PMID: 21697468; PubMed Central PMCID: 
PMCPMC3165808. 

51. Promkuntod N, van Eijndhoven RE, de Vrieze G, Grone A, Verheije MH. Mapping of the receptor-bind¬ 
ing domain and amino acids critical for attachment in the spike protein of avian coronavirus infectious 


PLOS Pathogens | https://doi.org/10.1371/journal.ppat.1007009 April 23, 2018 


18/19 




9'PLOS I PATHOGENS 


Structure, function, and evolution of IBV spike protein 


bronchitis virus. Virology. 2014; 448:26-32. Epub 2013/12/10. https://doi.Org/10.1016/j.virol.2013.09. 
018 PMID: 24314633. 

52. Seetharaman J, Kanigsberg A, Slaaby R, Leffler H, Barondes SH, Rini JM. X-ray crystal structure of the 
human galectin-3 carbohydrate recognition domain at 2.1 -angstrom resolution. Journal of Biological 
Chemistry. 1998; 273(21):13047-52. ISI:000073768500052. PMID: 9582341 

53. Chen L, Li F. Structural Analysis of the Evolutionary Origins of Influenza Virus Hemagglutinin and Other 
Viral Lectins. Journal of Virology. 2013; 87(7):4118-20. https://doi.org/10.1128/JVI.03476-12 
WOS:000315957100049. PMID: 23365425 

54. Rossmann MG. The canyon hypothesis. Hiding the host cell receptor attachment site on a viral surface 
from immune surveillance. J Biol Chem. 1989; 264(25):14587-90. Epub 1989/09/05. PMID: 2670920. 

55. Kint J, Fernandez-Gutierrez M, Maier HJ, Britton P, Langereis MA, Koumans J, et al. Activation of the 
chicken type I interferon response by infectious bronchitis coronavirus. J Virol. 2015; 89(2): 1156-67. 
Epub 2014/11/08. https://doi.Org/10.1128/JVI.02671-14PMID:25378498; PubMed Central PMCID: 
PMCPMC4300645. 

56. Pallesen J, Wang N, Corbett KS, Wrapp D, Kirchdoerfer RN, Turner HL, et al. Immunogenicity and 
structures of a rationally designed prefusion MERS-CoV spike antigen. Proc Natl Acad Sci U S A. 2017; 
114(35):E7348-e57. Epub 2017/08/16. https://doi.org/10.1073/pnas.1707304114 PMID: 28807998; 
PubMed Central PMCID: PMCPMC5584442. 

57. Lau SK, Li KS, Tsang AK, Lam CS, Ahmed S, Chen H, et al. Genetic characterization of Betacorona- 
virus lineage C viruses in bats reveals marked sequence divergence in the spike protein of pipistrellus 
bat coronavirus HKU5 in Japanese pipistrelle: implications for the origin of the novel Middle East respi¬ 
ratory syndrome coronavirus. J Virol. 2013; 87(15):8638-50. Epub 2013/05/31. https://doi.Org/10.1128/ 
JVI.01055-13 PMID: 23720729; PubMed Central PMCID: PMC3719811. 

58. Holm L, Sander C. Touring protein fold space with Dali/FSSP. Nucleic Acids Research. 1998; 26 
(1 ):316—9. ISI:000071778900076. PMID: 9399863 

59. Peng G, Yang Y, Pasquarella JR, Xu L, Qian Z, Holmes KV, et al. Structural and Molecular Evidence 
Suggesting Coronavirus-driven Evolution of Mouse Receptor. J Biol Chem. 2017; 292(6):2174-81. 
Epub 2016/12/31. https://doi.org/10.1074/jbc.M116.764266 PMID: 28035001 ; PubMed Central PMCID: 
PMCPMC5313091. 


PLOS Pathogens | https://doi.org/10.1371/journal.ppat.1007009 April 23, 2018 


19/19 




